CS224W: Methods of Parallelized Kronecker Graph Generation

نویسنده

  • Sean Choi
چکیده

The question of generating realistic graphs has always been a topic of huge interests. This topic has gained huge attention over the past few years with the advent of massive real-world network data that re generated by large software companies like Facebook and Google, along with the increase in the computation power that makes anyone capable of processing them. With real graphs at massive scale and parallelized frameworks to analyze them, network analysis became a major topic of scientific research. As the need to analyze these networks grew, the question of modeling and generating a real-world network graph at the same scale also became a topic of interest. Out of many approaches to model real-world networks, Stochastic Kronecker Graph (SKG) generation and its predecessor R-MAT generation have attracted interest in the network analysis community, due to their simplicity and their abilities to capture the properties of real-world networks. Along with such algorithms, a new programming methods to process large graphs called vertex-centric BSP with the implementations such as Pregel [3], Apache Giraph, GPS, and Apache Hama have become increasingly popular as an alternative to MapReduce and Hadoop, which are ill-suited to run massive scale graph algorithms [3]. The SKG, R-MAT and vertex-centric BSP, however, are not well-suited for each other. The obvious approach of parallelizing SKG, which is to generate edges in parallel, is not “vertexcentric” in nature and therefore is unnatural to program and runs inefficiently in vertex-centric

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CS224W: Social and Information Network Analysis

In this project, we plan to explore the property of self-similarity exhibited by real world networks, and the use of Kronecker graphs to model and analyze such networks. It has been observed that self-similarity is an emergent property of many real world networks such as WWW, e-mail and biological networks. These networks show properties such as heavy tails for the inand out-degree distribution...

متن کامل

Rigorous Analysis of Kronecker Graphs and their Algorithms

Real world graphs have been observed to display a number of surprising properties. These properties include heavy-tails for inand out-degree distributions, small diameters, and a densification law [5]. These features do not arise from the classical Erdos-Renyi random graph model [1]. To address these difficulties, Kronecker Graphs were first introduced in [5] as a new method of generating graph...

متن کامل

Stochastic Kronecker Graph on Vertex-Centric BSP

Recently Stochastic Kronecker Graph (SKG), a network generation model, and vertex-centric BSP, a graph processing framework like Pregel, have attracted much attention in the network analysis community. Unfortunately the two are not very wellsuited for each other and thus an implementation of SKG on vertex-centric BSP must either be done serially or in an unnatural manner. In this paper, we pres...

متن کامل

Modeling Network Structure using Kronecker Multiplication∗

Given a large, real graph, how can we generate a synthetic graph that matches its properties, i.e., it has similar degree distribution, similar (small) diameter, similar spectrum, etc? We propose to use “Kronecker graphs”, which naturally obey all of the above properties. We present a fast linear time algorithm for fitting the Kronecker graph generation model to real networks. Experiments on la...

متن کامل

The Kronecker Theory of Power Law Graphs —DRAFT—

An analytical theory of power law graphs is presented based on the Kronecker graph generation technique. The analysis uses Kronecker exponentials of complete bipartite graphs to formulate the the substructure of such graphs. This allows various high level quantities (e.g. degree distribution, betweenness centrality, diameter, eigenvalues, and isoparametric ratio) to be computed directly from th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012